Tradeoff in Rule Induction for Semantic Query Optimization
نویسنده
چکیده
Semantic query optimization (SQO) is a promising approach to the optimization of increasingly complex query plans in global information systems. The idea of SQO is to use semantic rules about data to reformulate a query into an equivalent but less expensive one. Since it is difficult to encode required semantic rules, a complete SQO system also includes a rule induction system and a rule maintainer. To maximize the net utility of learning, a rule induction system needs to learn those rules that are effective in reducing the query execution cost while robust against data changes to minimize the rule maintenance cost. This paper focuses on this tradeoff between effectiveness and robustness in the rule induction for SQO. The solution is to explicitly estimate the degree of the robustness of rules. The system can use the estimated robustness to make decisions to guide rule construction, guide rule repair, and control the size of a rule set. This paper also briefly reviews how robustness can be efficiently estimated and reports the initial experimental results. Learning for Semantic Query Optimization Semantic query optimization (SQO) (King 1981) promising query optimization technique for intelligent information mediators (Wiederhold 1992; Arens et al. 1993; Levy, Srivastava, & Kirk 1995) that integrate heterogeneous information sources because it can complement conventional query optimization techniques to overcome the heterogeneity and considerably reduce query execution cost. The essential idea of semantic query optimization is to use semantic rules about data to reformulate a query into a more efficient but semantically equivalent query. Example 1: Suppose we have a query that retrieves the ship classes and the maximal draft of the ships in those classes which satisfy the following conditions: the ships in the class are capable of carrying containers, and their draft is less than 50 feet. This query is specified as follows: query (?ship_class, ?draft) : 1 : ship_class (?ship_class, _, ?draft, _, ?ctnr), 2: ship (_, ?ship_class ,_, _, ?status), 3: ?ctnr ---"Y", 4: ?status ---"active", S: ?draft < BO. In addition to the query, suppose the system possesses a set of semantic rules. Among them two rules are applicable to this query: RI: IF ship(_,?class,?status,_,_) ship_class (?class, _, ?draft, _,_) & ?draft < 50 THEN ?status = "active" R2: IF ship_class(?class .......?ctnr) ?ctnr = "Y" THEN ship (_,?class ...... ) R1 states that If the mamimum draft of a ship is less than 50, then its status is active. R2 states that If a ship class has container capability, then there must emist some ships that belong to that ship class in the
منابع مشابه
Relevant Rule Derivation for Semantic Query Optimization
Semantic query optimization in database systems has many advantages over the conventional query optimization. The success of semantic query optimization will depend on the set of relevant semantic rules available for semantic query optimizer. The semantic query optimizer utilizes a set of available semantic rules to further explore extra query optimization plans for conventional query optimizer...
متن کاملTradeo in Rule Induction for Semantic Query Optimization
Semantic query optimization (SQO) is a promising approach to the optimization of increasingly complex query plans in global information systems. The idea of SQO is to use semantic rules about data to reformu-late a query into an equivalent but less expensive one. Since it is diicult to encode required semantic rules, a complete SQO system also includes a rule induction system and a rule maintai...
متن کاملRule Induction for Semantic Query Optimization
Semantic query optimization can dramatically speed up database query answering by knowledge intensive reformulation. But the problem of how to learn required semantic rules has not previously been solved. This paper describes an approach using an inductive learning algorithm to solve the problem. In our approach, learning is triggered by user queries and then the system induces semantic rules f...
متن کاملAnalysis of User query refinement behavior based on semantic features: user log analysis of Ganj database (IranDoc)
Background and Aim: Information systems cannot be well designed or developed without a clear understanding of needs of users, manner of their information seeking and evaluating. This research has been designed to analyze the Ganj (Iranian research institute of science and technology database) users’ query refinement behaviors via log analysis. Methods: The method of this research is log anal...
متن کاملQuery Architecture Expansion in Web Using Fuzzy Multi Domain Ontology
Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002